Update the cache row dim calculation in TBE SSD #4480

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

q10 wants to merge 1 commit into pytorch:main from q10:export-D77321062

Contributor

q10 commented Jul 12, 2025

Summary: - The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions. This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

netlify bot commented Jul 12, 2025 •

edited

Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`be097b6`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/687574445028420008b7d13b
😎 Deploy Preview	https://deploy-preview-4480--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

facebook-github-bot added the cla signed label

Contributor

facebook-github-bot commented Jul 12, 2025

This pull request was exported from Phabricator. Differential Revision: D77321062

facebook-github-bot added the fb-exported label

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

8e9bcfb

Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

q10 force-pushed the export-D77321062 branch 2 times, most recently from 8e9bcfb to 656d0ba Compare

July 14, 2025 19:17

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

656d0ba

Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

Contributor

facebook-github-bot commented Jul 14, 2025

This pull request was exported from Phabricator. Differential Revision: D77321062

1 similar comment

Contributor

facebook-github-bot commented Jul 14, 2025

This pull request was exported from Phabricator. Differential Revision: D77321062

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

be04ecf

Summary:
Pull Request resolved: pytorch#4480

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

q10 force-pushed the export-D77321062 branch 2 times, most recently from be04ecf to 04209be Compare

July 14, 2025 20:30

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

04209be

Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

ee4adb1

Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

q10 force-pushed the export-D77321062 branch from 04209be to ee4adb1 Compare

July 14, 2025 20:31

Contributor

facebook-github-bot commented Jul 14, 2025

This pull request was exported from Phabricator. Differential Revision: D77321062

q10 force-pushed the export-D77321062 branch from ee4adb1 to 696c39e Compare

July 14, 2025 20:34

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

696c39e

Summary:
Pull Request resolved: pytorch#4480

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

Contributor

facebook-github-bot commented Jul 14, 2025

This pull request was exported from Phabricator. Differential Revision: D77321062

q10 added a commit to q10/FBGEMM that referenced this pull request


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

ba9cfe9

Summary:
Pull Request resolved: pytorch#4480

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

q10 force-pushed the export-D77321062 branch from 696c39e to ba9cfe9 Compare

July 14, 2025 20:49


          Update the cache row dim calculation in TBE SSD (pytorch#4480)

be097b6

Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

q10 force-pushed the export-D77321062 branch from ba9cfe9 to be097b6 Compare

July 14, 2025 21:18

Contributor

facebook-github-bot commented Jul 14, 2025

This pull request was exported from Phabricator. Differential Revision: D77321062

facebook-github-bot closed this in

619b6ab

facebook-github-bot added the Merged label

Contributor

facebook-github-bot commented Jul 15, 2025

This pull request has been merged in 619b6ab.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed fb-exported Merged